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Abstract 

It has been shown lately the optimality of uncoded transmission in estimating Gaussian 
sources over homogeneous/symmetric Gaussian multiple access channels (MAC) using multiple 
sensors. It remains, however, unclear whether it still holds for any arbitrary networks and/or with 
high channel signal-to-noise ratio (SNR) and high signal-to-measurement-noise ratio (SMNR). In 
this paper, we first provide a joint source and channel coding approach in estimating Gaussian 
sources over Gaussian MAC channels, as well as its sufficient and necessary condition in restor- 
ing Gaussian sources with a prescribed distortion value. An interesting relationship between our 
proposed joint approach with a more straightforward separate source and channel coding scheme 
is then established. Further comparison of these two schemes with the uncoded approach reveals 
a lacking of a consistent ordering of these three strategies in terms of the total transmission 
power consumption under a distortion constraint for arbitrary in-homogeneous networks. We 
then formulate constrained power minimization problems and transform them to relaxed convex 
geometric programming problems, whose numerical results exhibit that either separate or un- 
coded scheme becomes dominant over a linear topology network. In addition, we prove that the 
optimal decoding order to minimize the total transmission powers for both source and channel 
coding parts is solely subject to the ranking of MAC channel qualities, and has nothing to do 
with the ranking of measurement qualities. Finally, asymptotic results for homogeneous networks 
are obtained which not only confirm the existing optimality of the uncoded approach, but also 
show that the asymptotic SNR exponents of these three approaches are all the same. Moreover, 
the proposed joint approach share the same asymptotic ratio with respect to high SNR and high 
SMNR as the uncoded scheme. 



1 Introduction 

Recent years have witnessed a tremendous growth of interests in wireless ad hoc and sensor networks 
from both academia and industry, due to their ease of implementation, infrastructure-less nature, 
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as well as the huge potentials in civil and military applications. In many instances, sensor nodes 
are deployed to serve a common purpose such as surveillance and monitoring environments. One of 
the major design metrics is to maximize the lifetime of a sensor net while meeting the constraints 
imposed by the quality of reconstruction, such as the resulting ultimate distortion measure when 
data collected by sensors are fused to construct an estimate of the monitored source. A critical factor 
affecting the lifetime of a sensor net is the amount of total power expenditure that senor nodes spend 
on transmitting their measurements to a fusion center. The power consumptions are closely related 
with the way sensors collecting and processing measurements, as well as the communication link 
quality between sensor nodes and the fusion center. 

In this paper, assuming L sensor nodes send measurements of a Gaussian source to a fusion 
center via a one-hop interference limited wireless link, we investigate the issue of power allocations 
across sensors with and without local compression and channel coding. A similar system model for 
Gaussian sensor networks has also been adopted recently by and 3. In [J , the authors 

investigated joint source-channel coding paradigm and analyzed how distortion scales with respect 
to the size of the network. They showed that uncoded transmission achieves the lower bound of 
the mean squared error distortion as the number of sensors grow to infinity in symmetric networks. 
However, no exact source and channel coding schemes are provided for general system settings other 
than the uncoded scheme. In j^, the "exact" optimality of uncoded transmission is proved even for 
the homogeneous Gaussian networks with finite number of sensor nodes. As pointed out in j^, it 
remains unclear though what approach is more favorable when a system becomes non-symmetric 
with a finite number of sensors. 

The objectives of this paper are two folds. First, we will propose a joint source-channel coding 
approach and then establish its relationship with the separate source and channel coding strategy. 
Second, we will investigate the optimal rate and power allocation strategy in order to minimize 
the total transmission power under the constraint that the mean squared error value in estimating 
the Gaussian source remotely is no greater than a prescribed threshold. In particular, we will 
compare the resulting total power consumptions of three distinct processing schemes, namely, joint 
source and channel coding, separate source and channel coding and uncoded amplify- and-f or ward 
approaches for in-homogeneous networks, and demonstrate the well known result of the optimality 
of uncoded approach for estimating Gaussian sources in homogeneous networks does not always hold 
in inhomogeneous networks. 

Our contributions in this paper can be summarized as follows: 

• A joint source and channel coding approach is proposed, whose achievable rate region is ob- 
tained. An interesting relationship between this approach and separate source and channel 
coding approach is then established. 

• Optimal decoding order for both joint and separate source channel coding is found which is 
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only a function of MAC channel ranking order, and has nothing to do with the power level of 
source measurement noise, when we intend to minimize the total transmission power. 



• Relaxed geometric programming problems are formulated in order to compare three schemes. 
Numerical results demonstrate the uncoded transmission is not always the best option. The 
ordering of the three schemes is highly dependent on relative channel qualities, measurement 
noise levels, as well as the distortion threshold. 

• Asymptotic results for large size homogeneous networks with finite SNR and SMNR are ob- 
tained which show the optimality of uncoded transmission from another perspective. More 
importantly, a condition is found under which the scaling factor of received channel SNR ver- 
sus signal-to-distortion-noise-ratio (SDNR), as SMNR grows to infinity, of joint approach is 
equal to that of the uncoded scheme. In addition, we prove the SNR exponents of all three 
schemes are the same. 

The paper is organized as follows. System model is set up in Section [2l A joint source and 
channel coding scheme is proposed in Section [U in which we establish its achievable rate region, as 
well as its relationship with the separate source and channel coding scheme. In order to compare 
joint and separate approaches, the formulated total power minimization problems are solved using 
geometric programming approach in Section HI where we also obtain the optimal decoding order 
for non-homogeneous networks. Uncoded approach is revisited in Section [5] from the perspective of 
making comparisons with the former two approaches. In Section [U we compare the aforementioned 
three schemes in asymptotic region for homogeneous networks. Finally, numerical results of our 
comparisons for arbitrary two-node networks are presented in Section [7l 

2 System Model 
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Figure 1: System model 
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Assume L sensor nodes observe a common Gaussian source Xo[i],i = 1, • • • ,n, where Xo[i] ~ 
AA(0, cr|) are identically and independently distributed Gaussian random variables with mean zero 
and variance cj|. The measurements Xj\i] = XQ[i] + Nj[i], j = 1, - ■ ■ , L from L sensors experience in- 
dependent additive Gaussian measurement noise Nj[i] ~ J\f{0, aj^,), where independence is assumed 
to hold across both space and time. Let Yj[i] denote the transmitted signal from sensor j at time i, 
which satisfies an average power constraint: 

1 " 

-Y.\YM'<Pj,J = l--- ,L. (1) 

1=1 



The processed signals {^ [i]} then go through a Gaussian multiple access channel and are super- 
posed at a fusion center resulting in Z[i] = J2j=i y/dj^jli] + W[i], where W[i] ~ M{0, Oy^) are white 
Gaussian noise introduced at the fusion center and assumed independent with Nj[{\. Coefficients 
gj,j = l,--- , L capture the underlying channel pathloss and fading from sensors to the fusion center. 
In this paper, we assume coherent fusion is conducted in the sense that gj are assumed perfectly 
known by the fusion center. Upon receiving {Z[«]}, the fusion center constructs an estimate {Xo[i]} 
of {Xo[i]} such that the average mean squared error De = lim.„_»oo ^ X]"=i ^ — ^o[^] of the 

estimation satisfies De < where D is a prescribed upper bound for estimation error. 

What interests us in this paper is power efficient schemes to estimate the Gaussian source remotely 
with a prescribed mean squared error. Three approaches, namely, joint source and channel coding, 
separate source and channel coding, and uncoded amplify-and-forward schemes, will be investigated 
in the sequel. 



3 Joint Source- Channel Based Fusion and Its Relationship with Sep- 
arate Source and Channel Coding 



In [5|], a joint source and channel coding scheme is proposed to estimate two correlated Gaussian 
sources remotely at a fusion center where measurements from two sensors are received through a 
Gaussian MAC channel. Achievable rate region was obtained as a function of the required distortion 
tuple in restoring two correlated sources. Inspired by their work, we, in this section, will first 
develop an achievable rate region for our proposed joint source-channel coding (JSCC) approach for 
any arbitrary network with L > 1 sensor nodes and then demonstrate an interesting relationship 
of JSCC with a separate source and channel coding scheme (SSCC) which is a straightforward 



combination of the recent findings on CEO problem ja| and traditional MAC channel coding [7|| with 
independent sources. 
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3.1 Achievable Rate Region for Distributed Joint Source-Channel Coding in 
Estimation of Gaussian Sources 



Let Rj,j = I,-"" denote the compression rate at the j-th sensor. There are total 2"^-' source 



codewords Uj = {U^''\ k = 1, - ■ ■ , 2"^^ } from which sensor j selects U^- 



Jnij) 



(mj) 



1,1 



,n} 



to represent Xj = = 1, • • • ,n}. The joint approach we propose here is to let each sensor 

directly transmit a scaled version of a source codeword \J^J^'\ The scaling factor introduced herein is 
to maintain the average transmission power Pj by sensor j , j = 1, ■ ■ ■ , L. Since L sensors see the same 
Gaussian source with independent measurement noise, L quantization vectors {U^™^ \ j = !,••• , L} 
are correlated. As a result, the decoding at fusion center needs to take into account of such correlation 
when it performs joint decoding of these L codewords. The decoded source/channel codeword uj™^^ 
are then linearly combined to obtain an MMSE estimate {Xol^li ^ = !)■■■ ,n} of the Gaussian source 
{Xo[i\,i = !,••• ,n}. 

We are interested in deriving the achievable region of rate tuples {Rj,j = 1, • ■ ■ ^L} such that 
2nRj^j _ 1^... source/channel codewords can be decoded with asymptotic zero error and the 
mean squared error De satisfies De < D. 

Theorem 1. To make De < D, Ri satisfy 



Ri = I{Xi-Ui) =ri + -log 



1 + ^(1-2-2-") 

C AT. 



, i = l, 



.L 



(2) 



where > 0, i = 1, • • • ,L are chosen based on 



1 1 A 1 - 2-2^'= 1 



fc=i 



D 



(3) 



and I{Xj; Uj) denotes the mutual information between Xj and a Gaussian random variable Uj, which 
is associated with Xj by 



Uj=Xj + V„j = !,■■■ ,L 



(4) 



where Vj, independent of Xj, are independent Gaussian random variables with mean and variance 



Proof. The proof is a straightforward application of the techniques used in proving Lemma 10 in 
For brevity, we only provide an outline here. 

We quantize with 2"^^^ Gaussian vectors such that the source symbol Xj[i] can 

be constructed from the quantized symbol through a test channel Xj[i] = Uj[i] + Vj[i], where 
Vj[i] is a Gaussian random variable with mean zero and variance which is independent of 

Uj[i] ~ M (o, (1 - 2~'^^i)ax^^ . Equivalently, we can also represent Uj[i] as Uj[i] = aXj[i] + Vj[i], 
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where a is the hnear-MMSE estimate coefficient and Vj[i] is the resultant estimation error. By 
orthogonal principle, we have 

air V 

and Vj[i] is a Gaussian variable independent of with mean zero and variance 2~^^J ^1 — 2"^^^ j aj^ 
Therefore, after normalization, we obtain 

Uj = l^U,=X, + ^V,=X, + V, (5) 



where Vj M (O, cr^,/(2^'^J ~ ^) ) • introduce variables rj such that 



.2 //'o2fl, 

_ ^ ^^2r, _ 1) ^ (6) 

which proves We can also see that rj is actually the conditional mutual information between Xj 
and Uj given Xq, i.e. Vj = I (Xj; UjIXq). Since I{Xj; Uj) = H{Uj) — H{Vj), it is then straightforward 
to show that ^ holds. 

Given Uj = Xj + Vj and Xj = Xq + Nj, where Nj and Vj are independent, we can construct the 
LMMSE estimate of Xq by Xq = Yl!j=iPj^3^ where coefficients f5j can be determined again using 
Orthogonal Principle. Based on Equations (95) and (96) in we obtain the desired result for the 
mean squared error in ([3]). 

□ 

From the proof of Theorem [H it can be seen that Ui and Uj are correlated due to the correlation 
between Xi and Xj, whose correlation can be captured by pij, the covariance coefficient between Xi 
and Xj, which can be computed as 



E[X,Xj] ^ a| 



The covariance coefficient pij between Ui and Uj can be obtained accordingly as: 



~Pi,=Pi,^{l-2-^R^){l-2-^R.). (8) 
After substituting Ri determined in Theorem [1] into it, we obtain 



~ [I + + q^y 
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where qi = -^(1 - 2 ^'"'). 

For any given subset S C {1,2,- •• define vectors U(S) 



^TTl 5***1 



|S| 



and U(S" 



where vr is an arbitrary ordering of the L indexes. The covariance matrix of 



U = [U(S), U(S'^)]-^ can thus be decomposed as 



E [UU^ 



(10) 



where Ss, ^s", ^s,s= denote the auto- and cross-covariance matrices of U(S) and 11(8*^). The 
entries of Slu are i^v)ij = Pi,j\/PiPj for i / j and (Su)j_j = i^j, i, j G {1, * * * > where pij is 
obtained in (19]). 




IS 



After each sensor maps the observation vector to C/j, an additional scahng factor 7j = 

imposed on C/j, where cr'ij. = cr'x./ — 2"^^^^ in order to keep the average transmission power of 
Yj[{\ = ^jUj[{\ as Pj. The received signal at the fusion center can thus be written as 



Theorem 2. Given the received signal Z[i],i = ,n in j77]) . the quantization rate Ri,i 
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1, 



, L obtained in Theorem U\ satisfies the following inequalities in order to restore Xq at the 



fusion center with distortion no less than D: 



\S\-l 



R{s) < / f/..;t/;j:i +I{U{Sy,U{S^)) 



i=l 



+iiuisy,z\uis')),\fsc{i,... ,L}, 



(12) 



where R{S) = YlieS ^-i' ^i'^) — ^ '5}, is the complementary set of S, {tti, • • • ,7r|s|} is an 

arbitrary permutation of S, and Ut^I^I = ^^TTi+i, * * * , 

Proof. The proof follows the footsteps of the one proving achievability of the capacity region for 
regular MAC channel with independent channel codewords. The difference here is that we need 
to take into account the correlations of the channel inputs from each user when the joint typical 
sequence technique is used to compute the upper bound of the probability of various error events. 
The details are deferred to the Appendix El □ 

It can be easily seen that when inputs to the channel are independent, the first and second terms 
in (fT2]) vanish and consequently the inequality reduces to the one characterizing the capacity region 
for MAC channels with independent inputs [7], Chap. 15.3]. 
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Next, we prove a sequence of lemmas in order to establish a connection between the JSCC and 
SSCC approaches. 

Lemma 1. Given Uj = Xj + Vj,j = 1,2 as in U2 — > X2 — > Xi Ui forms a Markov chain. 
As a result, we have 



I{U2;X2\Ui) = I{U2;X2) - I{Ui;U2) (13) 
I{Ui;Xi\U2) = I{Ui;Xi) - I{U2;Ui) (14) 
I{Ui, U2]X,,X2) = I{Xi- Ui) + /(X2; U2) - I{Ui; U2) (15) 

Proof. See Appendix [B1 □ 
Lemma 2. For Uj = Xj + Vj , j = 1, ■ ■ ■ ,L, the following relation of mutual information holds 

\S\-1 

I (C/(5); X(S)) = X^) - I {U^.,UZ[) , V5 C {1, . . . , L}. (16) 

ies 1=1 

Proof WLOG, consider 5 = {1, 2, • • • , s}. Define U2 = [U2, t/3, • • • , Us] and X2 = [X2, X3, • • • , X,]. 
Apparently, U2 X2 Xi ^ Ui forms a Markov chain. Prom (fTSl) . we immediately obtain: 

/[C/(5);X(5)] =/(Xi;C/i) + /(U2;X2)-/(C/i;U2) (17) 

Using same idea, it can be shown that 

/(U2;X2) = /(C/2;X2) + /([/3-- - ,Us;X3,--- , X,) - /([/s; [/|), (18) 

where I{U3 • • • , C/^; X3, • • • , Xs) can be decomposed in a similar manner. Such decomposition can 
be conducted iteratively until we reach 

I{Xs-i,Xs; Us-i, Us) = I{Xs^i; Us-i) + I{Us; Xs) - I{Us; C/._i) (19) 

Combining all iterations yields the desired result: 

s s—1 

I (UiS); X{S)) = ^ I{Uf,X,) - I {Ui-, t/^Vi) (20) 



4=1 i=l 



As the whole derivation does not rely on the exact order of {1, • • • , s}, we thus complete the proof 
of Lemma [2j □ 
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Lemma 3. For the same U{S) and X{S) as in LemmaUl we have 



I (f/(S); XiS)) - I ([/(5); = / [UiS); X{S)\U{S^)] 

= I piS) ; Xo \UiS^)] + ^ / [C/i ; Xi\Xo] 



Proof. See appendix 



(21) 



□ 



Theorem 3. When each sensor performs independent vector quantization and subsequently transmits 
the resulting scaled quantization vector through a Gaussian MAC channel, to reconstruct the Gaussian 
source at fusion center with distortion no greater than D, the necessary and sufficient condition is 
for any subset S C {1, 2, • • • , L} , the following inequality holds 



where 



and 



I [U{S)- Xo|[/(5'=)] + Y,I [Uv, X,\Xo\ < I [U{Sy, Z\U{S')] 



LHS= --log 



^5 "N., 



(22) 



(23) 



RHS = i log j 1 + ^^(5)^Qss Vg(S) 



(24) 



with ^yg{S)'^ = [^/gi,i E S] and Qsg = — Ss,s<=Ss= "'^^s<=,s- The auto- and cross- covariance 
matrices Ss, 5]sc, 5]s,s<= o,nd Xlsc^s o,re defined as in ^10^) . 

Proof. To construct an estimate of Xq at a fusion center with distortion no greater than D is 
equivalent to requiring that the minimum compression rate Ri satisfies Ri = I{Xi; Ui), as required 
by local vector quantization, and that Ri,i = 1, • • • ,L are in the region determined in Theorem [21 
Consequently, the conditions are translated to 



i=l 



+1 (UiS); UiS")) + / iuisy,z\uis'')) 



(25) 



From Lemma [2] and Lemma [HI this condition is equivalent to 



/ [uisy,Xo\uis')] + [u,;Xi\Xo] < I [uisy, Z\UiS')] 



(26) 
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Define ri = I {Xf, Ui\XQ). Then it is straightforward to show that the LHS of (f26l) is equal to that in 
([23]) by computing the mean squared error of estimating Xq using U{S) or [U{S), U{S'^)] Q], which 
is 



E[\X^?\U{S)\ 



^5 7fs 



(27) 



Given I [U {S); Z\U {S")] = H[Z\U{S^)] - H [Z\U{S),U{S^)] and U and Z are Gaussian ran- 
dom vector/variables, it is sufficient to get the conditional variance of Z given the vector U{S^). 
This can be boiled down to finding the conditional variance of YliieS ^/dlli^i given U{S^) as Z = 

Etl V9ll^U^ + W. 

Based on Theorem 3 in [81], we have 

Gov [U{S)\U{S')] = - Ss.scl^sc-'l^sc.s (28) 

Therefore, 



Var ( ^ 

\ie5 



f9ll^U^ = ^(5)^Qss Vg(5). (29) 



The entropy can thus be computed accordingly yielding 

H [Z\U{S-)] = \ log [27re (a^, + v^(5)'^Qss^(5))] 

B [Z|[/(5^), ?7(5)] = \ log (27rea2,) (30) 

which leads to ([241) . and hence completes the proof. □ 

3.2 Relationship With Separate Source-Channel Coding Approach 

If we look closely at (f22l) and ([231) . we can easily see that the LHS of the achievable rate region 
for the JSGG approach actually characterizes the rate-distortion region for Gaussian sources with 
conditionally independent (GI) condition Q]. 

Under the GI assumption, distributed source coding at sensors includes two steps. The first step 
is the same as in JSGG, in which an independent vector quantization for Gaussian source at each 
sensor is conducted with respect to the observed signal Xj = = !,••• ,n}, which generates 

a vector Uj' = {f7j^[z], i = 1, • • • , n}, G {1, • • • , 2"-^^}. In the second step, those indexes of kj are 
further compressed using Slepian-Wolf 's random binning approach 0,0]. Gonsequently, there are 
c^nRj |-,jj^g £qj. sensor j, which contain all representation vectors Uj' of measurements Xj. It was shown 
in that satisfy: EjgS ^ ^ \U{S);X^\U{S% + Y.i^s ^ MM. for all 5 C {1, 2, • • • , L} 
in order to restore X remotely with distortion no greater than D. 
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For SSCC, to send indexes of bins correctly to the fusion center, independent Gaussian codewords 
{Yj[i] ~ J\f{0,Pj),i = jTi} for J = 1, • • • for each bin index are generated at L sensors. 

To ensure indexes are correctly decoded at the fusion center, the rate tuple {Ri,i = ,L} 
should also be contained in the capacity region of Gaussian MAC channel with independent channel 
inputs under power constraints {Pj,j 



1, • • • ,L}. The region is characterized by Xlies 



ilog 1 + ^^^^^ ,for all5C{l,2 

The data processing at the fusion center consists of three phases. In the first phase, channel 
decoding is performed to recover the indexes of bins containing = 1, • " " i-^}- In the second 

phase, joint typical sequences {Uj'} are obtained from L bins whose indexes are restored. In the last 
phase, {Uj'} are linearly combined to estimate the source vector {Xo[i]} under the minimum mean 
squared error (MMSE) criterion. 

Under SSCC, we can therefore obtain the sufficient and necessary condition for restoring Xq with 
MSE no greater than D: 



< - log 
- 2 ^ 



a 



w 



,V5C{1,2,... 



(31) 



In general, we cannot say which approach, JSCC or SSCC, is better in terms of the size of rate 
region. This can be seen more clearly when we look at a particular case for L = 2. When there are 
only two sensors, to reconstruct {Xo[i]} with a distortion no greater than D using JSCC or SSCC 
proposed as above, the transmission powers Pi and P2, as well as ri and r2 satisfy: 



n-^log^ + ^(l-2--) 
<2l°Ml + ^2 1 



(32) 



1 , De De 

^^2 - 77 log < — + ^- 



w 



1-2 



-2ri 



< 



^1, (, , P2g2(l-Pf,2) 

n + r2 + -log ^— 

2^°H' + ^ ) 



(33) 



(34) 
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where pi^2 denotes the covariance coefficient between Ui and Uj^ which is zero for SSCC and 



^i'2 = W jT-. wTT- r, (35) 

for JSCC, as obtained in ([9]) with satisfying 

•3 fc = l J^fc 

It can be easily seen from (f32]l - ([Ml) that inequahties of ([32]) and ([331) under JSCC are domi- 
nated by those under SSCC, i.e. {Pj,rj} satisfying (f32l) and ([33]) under JSCC also satisfies the 
corresponding inequalities under SSCC, while the inequality (f34l) under JSCC dominates that under 
SSCC. 

To compare SSCC and JSCC, we next formulate a constrained optimization problem in which 
the objective is to minimize the total transmission power of L sensors with a constraint that the 
distortion in restoring X is no greater than D. For L = 2, the problem can be stated as 

min P1+P2, subject to (I32])-(I33) and JM]). (37) 

Pi,ri,i=l,2 

which becomes power/rate allocations for SSCC and JSCC, respectively, for difi'erent correlation 
coefficients p. The optimization results for SSCC and JSCC under different channel and measurement 
parameters will reveal to us the relative efficiency of SSCC and JSCC, which will be further compared 
with that for an uncoded scheme, as investigated in the next few sections. 

4 Optimal Power and Rate Allocations to Minimize the Total Trans- 
mission Power 

4.1 Geometric Programming Solution to Power/Rate Allocations 



The constrained optimization problems in (1371) are non-convex. They can, however, be solved effi- 
ciently using standard techniques in convex optimization by transforming the original problems into 



relaxed convex geometric programming problems 10|] 



In this section, we take SSCC as an example to demonstrate how it works. For SSCC with 
p = 0, the rate tuple {Ri,R2) should be taken from the boundary of the capacity region for two-user 
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Gaussian MAC channels to minimize Pi + P2. Consequently, 



Ri = - log ( + log 1 + 



2 

P2 = |log( 1 + 



a 



w 



P292 



91 Pi + (^w 



52^2 + 



cr 



w 



(38) 



where a G [0, 1] is a time sharing factor. 

Define yj = 2^^^ , zj = 2^^'j for j = 1,2. We can transform this total power minimization problem 
for SSCC with L = 2 to an equivalent generalized Signomial Programming problem |lll |: 



min Pi + P2, subject to: 

Pj,yj,zj,j=i,2 



{92P2 + a'wf-''\i < 
(1 + ^) {9lPl+92P2 + <yw) 



{giPi + aw) 2/2 < 



1 + 



92P2^^'-^ 



a 



w 



2 



2 \" 



2^2 



+ 



2^2 



+ D 



Vl ^92 ^ZiZ2<tI/D < 1 



_1 '^s^N 



Hy2^^2 < 1 



(39) 
(40) 

(41) 
(42) 

(43) 

(44) 
(45) 



where constraints (|40)) and (|4T|) are obtained by relaxing equality constraints in (l38|) . and constraints 
([42]) - (l45ll result from the transformation of (l32]) - (f34ll . which are in the form of f{x) < 1, where f{x) 



is a posynomial function of n variables [lO||: f{x) = X^^i c^x^^'^Xg^* •••x^"*, where > and 
Xj > for j = 1, ■ ■ ■ , n and OyG 7^. In addition, constraints (j40]) and ([4T]) are in the form of 
generalized signomial functions (ill. Il2l| w ith fractional powers. 

Single condensation technique ljl2| can then be applied to convert this Signomial programming 
problem to a standard geometric programming (CP) problem. In this method, we replace (1 + 



PiQi/uy^) in the RHS of (|40]) by its geometric mean ( 0-^^/12 ) ' ^'^^ similarly (1 + P2g2/o'w 



in the RHS of gU by /?^f f^^)^'', where > and + /3i,2 = 1 for i = 1,2. In addition. 



we also replace {giPi + 92-^2 + ^vi/) by its geometric mean: ( ^ 



71 



72-j^' (^)^', where 7i>0 



91 



and 7i = 1. Finally, to handle fractional powers in the LHS of ([40]) and (j4T]) . we introduce 

two auxiliary variables ti and ^2 to replace giPi + o"^ and (72P2 + c^, respectively, in the LHS of 
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([40]) and ([4T]) . Accordingly, two additional posynomials are introduced on the list of constraints: 
QiPi + a^ < ti for i = 1,2. 

The resulting standard geometric programming problem can thus be solved in an iterative manner 
by repeatedly updating normalization coefficients Pij and 7^, and applying interior point method for 
a given vector of these coefficients 

m. 

Using the similar method, we can also transform the optimization problem for the JSCC approach 



to a convex Geometric programming problem. The details are skipped here (l3|] 



4.2 Optimal Source/Channel Decoding Order for Non- Symmetric Channels 



Although the optimization problems formulated in (l37l) can only be solved algorithmically, we can 
still manage to obtain some insights by scrutinizing the problem structures. In this section, we will 
reveal some relationships between the optimal decoding order and channel attenuation factors for 
non-symmetric networks. 

4.2.1 Separate Source and Channel Coding 

We first show the optimal source encoding/decoding order, as well as channel decoder order for 
SSCC is uniquely determined by the ordering of channel attenuation factors {gi,i = !,••• ,L}, and 
has nothing to do with the ranking of sensor measurement noise power {c^. ,i = 1, - • • , L}. 

Theorem 4. For SSCC, let it* denote any permutation 0/ {!,••• ,L} such that < 9tt*{2) ^ 

■ • • ^ 9-K*{L)- To minimize the total transmission power, the optimal decoding order for channel codes 
at receiver is in the reversed order of it* , i.e. interference cancellation is in the order ■k*{L),-k*{L — 
1), • • • ,7r*(l), which is also the decoding order of distributed source codewords. 

Proof. The proof consists of two steps. First, we will determine the channel decoding order for a 
given vector of source encoding rates {Ri,i = 1, • • • ^L}. For SSCC, the rate tuple {Ri} satisfies 
ZliGS log (1 + Y^idS Pi9i/crw) ' which is equivalent to 

5^ X, > f{S) ^ n 2'''' - 1, V5 C {1, . . . , L}. (46) 

where Xi = Pigi/a^y and / : 2^ — > TZ^ is a set function with E = {!,••• ,L}. Given {Ri}, the 
optimization problem then becomes min'^f^^ a^^Xi/gi, subject to ([46]) . 

Based on the Corollary 3.13 in (3], the set of power vectors {Xi} satisfying ([46]) is a contra- 
polymatroid g{f), as / satisfies (1) fU) = (2) f{S) < f{T) if S C T (3) f{S) + f(T) < 
f{SUT)+f{SnT). From Lemma 3.3 in [1J|, the minimizing vector {Xi, i G E} for min X^^^ a^,Xi/gi 
is a vertex point {X^.(j)} of ^(/), where vr* is a permutation on the set E such that l/gTT*{i) > • • • > 
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l/97r*(L) and 

W = / • • • , - f ({vr*(l), • • • , vr*(z - 1)}) 

i i—1 

= Y[ 2^-^-* (J) - Y[ 2^^^*^^) (47) 

which thus proves the first part of this Theorem. 

We next use ([471) to transform the objective function to 

For SSCC, rate tuples {Ri,i = l,-" ,L} satisfy J2ieS > H^iS); U{S)\U{S^)). It is therefore 
quite straightforward to show that in order to minimize the total transmission power in (|48p . we 
need to have X]j=i achieve the lower bound, i.e. 

which implies that the decoding order for source codewords is vr*(L), • • • ,7r*(l), independent of the 
ordering of variances {cTVi, i = 1, • • • , L} of measurement noise. 

□ 

Theorem IHimplies that sensor vr*(L) does not conduct random binning and its quantization vector 
{^7r*{L) W} is restored first. {f^7r*(L)N} is then used as side information to restore sensor 7r*(L — l)'s 
source codeword {t/7r*(L-i) [^]} from this node's first stage Gaussian source vector quantization, which 
resides in the bin whose index is decoded from the channel decoding step. This process continues 
until sensor 7r*(l)'s first stage quantization vector {?7^.(i) [n]}is restored by using all other sensors' 
quantization as side information. 

In the next section, we will see a similar conclusion can be reached for JSCC. 



4.2.2 Joint Source and Channel Coding 

Unlike in the SSCC case where we have a nice geometric (contra-polymatroid) structure which 
enables us to reach a conclusion valid for any arbitrary asymmetric networks, JSCC in general lacks 
such a feature for us to exploit. We will instead , in this section, focus on a case with only L = 2 
sensor nodes and establish a similar result as in Section [4.2.11 for optimal channel decoding orders. 
WLOG, we assume gi > g2 in the subsequent analysis. 
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Theorem 5. Given a pair of quantization rates Ri and R2, when gi > §2, the optimal decoding 
order to minimize the total transmission power Pi + P2 is to decoder node 1 's signal first, and then 
node 2's information after removing the decoded node 1 's signal from the received signal, i.e. 

where p is the same as in [23 

Proof. See appendix [D1 □ 



R2 = I{U2; Z, Ui) = ^ log ( ) (^0) 



5 Uncoded Sensor Transmission in Fusion 

For Gaussian sensor networks as modeled in Section [21 it has been shown recently that uncoded 
transmission, i.e. each sensor only forwards a scaled version of its measurements to the fusion center, 
asymptotically achieves the lower bound on distortion when the number of sensors grow to infinity 
and system is symmetric. In the context of the theme of this paper, we, in this section, investigate 
the optimal power allocation strategy when a finite number of sensors deploy the uncoded scheme 
under more general channel conditions. 

For uncoded transmission, the transmitted signal by node j is Yj[{\ = ajXj[i]., where aj = 
■i J -i is a scaling factor to make the transmission power £'|yj [i]| = Pj. The received signal at the 



fusion center is therefore Z[i] = Yl,j=i "^^^ linear MMSE estimate of XQ[i] is: Xq[i 

jZli], where the coefficient 7 can be obtained using Orthogonal principle: E (^XQ[i] — XQ[i]^ Z[i] 
0. The resultant MSE is 



E 



Xo[i]-Xo[i] 



W 



(51) 



where B = '}2i=i ^i^j j=i Pi J V^idiV Pj9j the covariance coefficients pij is the same as in ((T]). 

2 



When L = 2, the power control problem under a distortion constraint E 



< D for 
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the uncoded scheme can be formulated as: 



mill Pi + P2 subject to: 



< - loe; 



a 



1^ ( 1 + - Pi9i 



+ 



N2 



P2g2 



1 + 



a 



w 



(52) 



This problem can again be transformed to a GP problem using the condensation technique applied 
in Section I4T1 We skip the details here. 

What deserves our attention is that when we compare the constraint in ([52]) with (f34l) . there is 
a striking similarity when we substitute rj with Ri, whose relationship was introduced in jH]). After 
substitution, (IMl) becomes 



1, \a 
- log 
2 ^ 



< - loe 
_ 2 & 



1 + 



Pigi + P2g2 + 2/0l,2VA5l-P252 



a 



w 



(53) 



where Aj 



-(2^^' — 1) for i = 1, 2, and pi^2 and pi^2 are associated as in 



For JSCC, we have Ri < I{Ui\ Y\U.j) + I{Ui; Uj), which is equivalent to 



1 



PI2 



< 



Pigi 



'W 



(54) 



We can infer from (f32l) - (f34ll . as well as ((521) that it is in general hard to argue which approach is the 
most energy efficient in terms of the total power consumption under a common distortion constraint 
D, which will be further exemplified in our simulation results in Section [7l There, we will see the 
most energy efficient approach depends on exact values of cj^ , as well as gi and D. 



However, when a system becomes homogeneous and symmetric in the sense that a 



a 



and Qi = Qj for all i,j £ {!,••• ,L}, we have consistent results for both finite number of L and 
asymptotically large L, as revealed in the next section, when we compare these three approaches. 



6 Energy Consumption Comparison for Homogeneous Networks: 
Finite and Asymptotic Results 

In this section, we provide analytical results on comparisons between different transmission strategies 
proposed thus far, including JSCC, SSCC and uncoded schemes in terms of their total transmission 
power consumptions when system becomes homogeneous. 
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6.1 Comparison under finite L and finite SNR 

Theorem 6. When a system of L < oo sensors becomes symmetric, the total power consumption 
for the separate, joint and uncoded schemes proposed previously follow: 

^] < (El) < (^] < (El) (55) 
^wJlob \^w)a \^w)j \^w)s 

where ( ) is the lower-bound on transmission power, and g is the common channel gain from 

\ "'w J LoB 

each sensor to the fusion center. Indexes A, J and S represent the uncoded, joint and separate 
encoding schemes, respectively. 

Proof. The proof hinges upon the analysis of rate and power allocations for symmetric networks for 
different schemes. 

Separate Coding: 

We first look at the separate source and channel coding approach. In symmetric networks, for 
source coding part, each node employs identical compression rate R, which satisfies LR = Lr + 

2 

i log ^ , where r is the solution to 

J_ + JL(i_2-2^) = 1. (56) 



As a result 




(57) 



For channel coding part, to minimize the total transmission power, it is optimal to let each sensor 
transmit at the same power P and same rate R which can be achieved by jointly decoding all node's 
information at the fusion center. Therefore, the total compression rate also satisfies 

LR =\\og( I + (58) 

Combining (fSTl) and ([581) yields 



1 al 



1 1\<^N- 



D ay L 



(59) 



Joint Coding: 

For the joint source-channel coding scheme, the vector quantization rate for each sensor is equal 
to i? = I{Xi; Ui), which is associated with r as shown by where r can be further obtained using 
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JM]). 

By applying the techniques used in driving the constraints in (f32]) - ([34|l to the L > 2 case when 
channel is symmetric, the quantization rate and the resultant L-user multiple access channel with L 
correlated inputs {C/i, • • • ,Ul} are associated by 



Lr + - log — 
2 ^ D 



liUi,--- ,Ul;Z) 
H{Z)-H{Z\Ui,--- ,Ul), 



(60) 



where the conditional entropy H{Z\Ui, • • • , Ul) = ^ log (^l-Keayy) , and the entropy of Z = \/9lUj+ 
W is H{Z) = \ log [27re (a^, + 5Var(7 E,- 



Given the covariance coefficient between C/j and 



(1 - 2^2i?)^ j^ave 



From (f56]l and JS]), we obtain 



E\-f^Uj\^ = LP + {L^ - L)pP 



(61) 



P 



D 



2 \ -1 
L + ^ 



1 



(62) 



The transmission power P for joint coded scheme in symmetric networks can thus be computed 
using (jeOl) and (IHTTl: 



'W / J 



.1 + ^(22-)^ 



l+[L--]p 



(63) 



Comparing (f63]l with ([59]) , it is apparent that ( ) < ( ) . 
Uncoded Scheme 

When sensor transmits scaled measurements in a symmetric network, we can obtain the minimum 
transmission power P by making the mean squared error obtained in ([ST]) equal to D and substituting 
Pj, Qj and 0"^, by P, g and cr^^, respectively. As a result, we obtain 



2 

J A 



D 



a 



ai + a 



^-1 
D 



(64) 



To compare ^^^r^^ with ^"^"^ ) we need to introduce an auxiliary variable. Define 



al + air 
Ql = 1- ^ o ^' p<l-p. 



a 



(65) 



s 
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We can therefore re-derive the minimum power for uncoded scheme: 



Pg 



a 



W/ A 



(66) 



In addition, we can express 2 



2r 



Ik 
L 



as 



i2r 



Ql 



> 1 



(67) 



which is used to transform ( ) as 



Pg 



w y J 




L-l 



1 

'Ql 



L-l + al/D 



{6i 



where the second term ^ /"^"^^^^ 
([68]) and (f66ll , it is straightforward to show ( I > 



j^^Tj^ — < 1 due to L > 1 and cr| > D. Since 1 — p > Ql, after comparing 

'^J A 



Lower Bound of Transmission Power 

Since Xq {^i, • • • , Xi} — > Z ^ Xq forms a Markov chain, by Data Processing Inequahty 
we have I{Xq;Xq) < I{Xi, • • • , Xl] Z). On one hand, to ensure E\Xq — XqP < D, it can be shown 



IiXo;Xo) > ^log using rate distortion results [Tj. On the other hand, the mutual information 
I{Xi, • • • , Xl; Z) is upper-bounded by the mutual information of an additive noise Gaussian channel 
with channel gain ^ and total transmission power upper- bounded by E\ ^jl^ Xj\^ = LP + (L^ - 



L)pP, where p 



is the covariance coefficient between Xi and Xj for i / j. Consequently, 



I(Xi,--- ,Xl;Z) < - log 



1 + ^{L + {L^-L)p) 



w 



(69) 



Prom I log ^ < I{Xq]Xq) < I{Xi,--- ,Xl;Z), we obtain the lower-bound of transmission 
power: 

(70) 



'W/LoB L+{L^-L)p 

Compare this lower bound with (l64l) . we have ( | < ( ^] . 

\<^wJloB K'^wJa 

Therefore, we have shown that the order in ([55]) holds and thus completed the proof. 



□ 
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6.2 Comparison under L ^ oo and finite SNR 

In this section, we provide the scahng behaviors of the total transmission power of various schemes 
studied so far. In particular, we are interested in how scales with respect to the number of 
sensors L in a symmetric system under a common constraint on distortion no greater than D. The 
analysis is quite straightforward based on the results for a finite L number of sensors that we have 
obtained in ([59]l . ([681) . (f66l) and (fTOl) . for separate, joint, uncoded schemes and the lower bound, 
respectively. 



Theorem 7. 



lim 

L^oo 



lim 



LgP 



a 



w / s 



^exp 



1 

D 



(LgP\ 








2 = exp 






"4). 



D 



lim 



w 



lim 



L^gP 



a 



w 



LoB 



(71) 



Proof. The proof of these convergence results is quite straightforward based upon the results for 
finite L as above, and is skipped here. 

□ 



It is obvious that the transmission power of the uncoded scheme shares the same scaling factor as 
the lower-bound, which is in the order of 1/L^. The asymptotic optimality of the uncoded scheme 
in symmetric Gaussian sensor networks is not a new result, which has been attained previously in 
[l| In [l|, the authors assumed a fixed transmission power and showed that the distortion 
achieved using uncoded approach has the same asymptotic scaling law as that obtained via a lower 
bound. Here, we provide a different perspective in assessing its optimality for the uncoded scheme 
in terms of the total transmission power while meeting a fixed distortion constraint. 

Both joint and separate coding schemes have the scaling factor in the order of 1/L. Asymptoti- 
cally, joint coding scheme saves in total transmission power by a factor of c'g/D as compared with 
the separated approach. 



6.3 Comparison under finite L and SNR — > 00 

Given the number of sensors L < 00, we are interested in the scaling factors associated with trans- 
mission SNR P/o"y(/, mean squared error D and measurement noise variance o"^ in homogeneous 
networks. In particular, we need to investigate how the following asymptotic factors are related, 
cr^ ^ 0, I) ^ and P/(t^ 00. 
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We first need to identify the limit imposed upon the scahng factor related with aj^ and D. As 
can be seen from both (f56l) and ((Ml) under SSCC, JSCC and uncoded approaches, it is required that 



'''' 



Denote 7* = \im^2^^Q-^. It can be seen that 7* has to satisfy 7* G [0, 1] in order to have the 
inequality in (f72l) hold. Consequently, 

A.= limAK)-/''' (73) 

I i 7* = 1 

Theorem 8. The asymptotic ratios associated with D, aj^ and P/cf^t have the following relationship: 



Proof. The proof follows straightforwardly with (f59l) . ([63]) and ([64ll for SSCC, JSCC and uncoded 



schemes, respectively, as we let cj^ — > and ^ LA*. □ 



We can see from (f74ll - ([76]) that when 7* G [0, 1), i.e. A* = 0, the JSCC and uncoded schemes 
have the same aysmptotic ratio between the received SNR and S-MSE-Ratio, which is smaller than 
that for the SSCC approach by a factor of 1/L. If 7* = 1, i.e. A* = uncoded approach has the 



smallest ratio among all schemes. However, if we introduce the SNR exponent as defined in |16l |. 



'I,- 



All three approaches share the same ratio 77 = 1 for finite L. 

Theorem [8] therefore provides us another perspective to compare these remote estimation ap- 
proaches. It demonstrates the proposed joint source and channel coding scheme has potentially the 
same asymptotic performance as the uncoded one in high channel SNR and high measurement SNR 
regions for all ratio exponents 7* € [0, 1). In addition, speaking of SNR exponent of distortion mea- 
sure in the high SNR region, all three schemes investigated in this paper share the same asymptotic 
ratio r] = 1. 

An additional remark we next make is about the limitation as to adopting SNR exponent r/ 



as a metric to characterize the asymptotic performance in large SNR regions (l6|]. It can be seen 



22 



clearly from the above analysis that the SNR exponent obscures the asymptotic difference between 
SSCC and JSCC, as well as the uncoded approaches, which have different linear ratios as shown in 
Theorem [HI These differences are gone once log-scale is imposed, however. 

Note also that we have implicitly assumed that the spectral efficiency of the system model in 
this paper, which is the ratio the source bandwidth over channel bandwidth is one. 



7 Numerical Results 



Source , , , Fusion Center 

^ ► d'i 



d 



L 



Figure 2: 1-D location Model 



In this section, the three approaches proposed in this paper are examined and compared with 
each other by looking at each of their optimal total transmission powers under the constraint of 
restoring Xq with MSE no greater than D, some prescribed threshold. Particularly, we consider 
a linear network topology where a source, fusion center and L = 2 sensor nodes are located on 
a same line as illustrated in Figure [2l To associate positions of sensor nodes with channel gains 
and measurement noise, we assume a path-loss model with coefficient Ps and /3c for gi and aj^_, 
respectively: gi oc l/(do — di)^" and aj^^ oc dj", for z = 1, • • • , L, where do is the distance between 
source and fusion center, and di is the distance between the i-th sensor and source. Given a distortion 
upper-bound D < and (3c = = the distance between the source and fusion center has to 

satisfy the following inequality, do < {LV^^ tt ^ , which is obtained by making the MSE 

I '^s J 

using {Xi} to estimate Xq no greater than D. 

We then run the geometrical programming based optimization algorithm to determine the min- 
imum total transmission powers for various approaches. We consider 9 spots uniformly distributed 
between the source and fusion center for possible locations of two sensors, which are indexed by 
integers 1 through 9. The smaller the index value is, the closer the sensor is located to the source. 
Figure[3{a), Figure[Hb) and Figure[3jc) demonstrate the total minimum power consumption P1 + P2 
as a function of nodes' locations for three sensor processing schemes, from which we have following 
observations: 



As proved in Theorem [HJ when network is symmetric, Ptotal A < Ptotal J < Pt 



total, S • 



Under a relatively large distortion constraint (e.g. D = 0.5), uncoded scheme is the most 
energy efficient among the three candidates for all sensor locations, as shown by Figure [3{a). 
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• Under relatively small distortion constraints (e.g. D = 0.1 and D = 0.01), separate coding 
approach becomes the most energy efficient when the relative position of two sensors becomes 
more asymmetric. For example, in both Figure [3fb) and Figure [3fc), at a location with an 
index pair (1,9), i.e. the first sensor is closest to the source and the second sensor is closest to 
the fusion center, we have Ptotai,A > Ptotai,j > Ptotai,s- 

• Overall, to minimize the total power expenditure, we should choose either uncoded transmission 
or separate coding scheme for a given pair of locations. This is a bit surprising as joint coded 
approach is often advocated more efficient (rate wise) than the separate one. It thus exemplifies 
that exact values of channel conditions and the level of measurement noise are crucial to 
concluding which scheme is the most power efficient in non-symmetric Gaussian networks with 
a finite number of sensors. 



A Proof of Theorem [2] 



Proof. From Theorem [U we know that each sensor finds from 2" ^ codewords the closest one 
U^^^ = {U^^\iW to the observation vector and then amplify- and-f or wards {^[i]} to the 

fusion center. The decoder applies jointly typical sequence decoding 7|| to seek {U^^\j = 1, • • • , L} 
from L codebooks which are jointly typical with the received vector 

WLOG, re-shuffie 2"-^J vectors such that f/j^^ is the vector selected by sensor j G {1, • • • , L}. We 
assume that a subset U^^^S") = {uf\i £ S^} has been decoded correctly, while its complementary 
set U^^\S) = {Uj^\j G S} is in error, which implies that the channel decoder at fusion center is 
in favor of a set of vectors U^''\S) = {U^^^\ kj 7^ 1, j G S}, instead. Next, We will find the upper 
bound of the probability that (^U^^\S^), and U^''\S) are jointly typical. 

The technique to upper-bond this probability is quite similar as the one for MAC channels with 
independent channel inputs 3, Chap 15.3]. The major difference here is that the channel inputs from 
L sensors are correlated because of the testing channel model used in independent source coding, i.e. 
Uj = Xj + Vj =Xo + Nj + Vj, for j = 1, • • • ,L. 

The upper bound of the probability that (U^^\S'^)., and U^^\S) are jointly typical is therefore 

2n{H{U{S),U{S-),Z)+e)2-n{H{U{S-),Z)-e)2-nY.i^s{H{U,)-e) ^^g^ 



exp2 -n 



\H{U{S^),Z) + Y,H{U,) 



i&S 

-H{U{S),U{S^),Z)-{\S\ + 2)e)) (79) 

where the first term in ([78]) is the upper bound for the number of jointly typical sequences of 
{U{S),U{S''),Z), the second term in ([78]) is the upper bound of the probability P (f/(^)(S"^), Z) and 
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the last term in ([78]) is the upper bound of the probabihty P {U^^\S)) . The summation in the last 
term in (f78]l is due to the independence of codebooks generated by each sensor and the assumption 
that decoder is in favor of some U^''''^ for kj / 1 and j G S, which are independent of C/^^)(5). 

Since we have at most 2"^i6s^j number of sequences to be confused with Uj^\j G S, we need 

^R,<H{U{S'),Z) 
jes 

+ H{Ui) - H{U{S), U{S'), Z) -{\S\ + 2)6 

ies 
\s\-i 

= J2l{u^^'UZ[)+imS);U{S%Z)-{\S\+2)e (80) 
1=1 

for all 5 C {!,••• , L} and any arbitrarily small e in order to achieve the asymptotic zero error 
probability as n ^ oo, which thus completes the proof. 

□ 



B Proof of Lemma [T] 

Proof. As 1/2^X2^X1^ Ui forms a Markov chain, we have 

I{U2;X2,Xi\Ui)^^ I{U2;Xi,X2,Ui)-I{U2;Ui) 
/(C/2; X2) + /(C/2; Xi, Ui\X2) - I{U2; Ui) 

'-^ I{U2;X2)-I{U2;Ui) (81) 

where equations (a) and (6) are due to the chain rule on conditional mutual information Given 
Markov chain of U2 X2 Xi ^ Ui, U2 and {Xi, Ui) are conditionally independent given X2 and 
consequently I{U2; Xi,Ui\X2) = leading to equation (c). 

On the other hand, following equations also hold under similar arguments: 

I{U2 ■,X2,Xi\Ui)^^ I{U2;X2\Ui) + I{U2;Xi\X2,Ui) 
I{U2;X2\Ui) + I{U2;Xi,Ui\X2) - /(C/2; C/1IX2) 

I{U2;X2\Ui) (82) 

Therefore, combining (fSTI) and ((82]) yields: 

I{U2;X2\Ui) = I{U2;X2) - I{Ui;U2) (83) 
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and similarly, 



/(^7i;Xi|C/2) = - /(C/i; C/2) 



(84) 



which thus proves (fT3l) and (fT4|l . 

We can prove (fTSl) by firstly showing that 



/(C/i , C/2 ; Xi , X2) = /(Xi , ; C/i ) + 1(^72 ; X2 , Xi I C/i ) 



(85) 



under the chain rule, where 



I{X^,X2;Ui) = I{X,;Ui) + I{X2;Ui\Xi) = I{Xr,Ui) 



(86) 



because of U2 X2 Xi ^ Ui. Since we have already proved ([8T]l . it is straightforward to show 
that (dH) holds. 



C Proof of Lemma [3] 

Proof. Due to the independence of measurement noise, U{S) — > X{S) — > X(5'^) U{S^) forms a 
Markov chain. The first equation in (f2T]) is a direct application of (fT3l) . 

The proof of the second equation is based upon another Markov chain by adding the source 
random variable Xq into the former one: U{S) — > X{S) — > Xq — > X{S^) U{S'^). Prom this 
Markov chain, we can deduce 



□ 



/ [U{S); Xo\U{S')] = I [U{S'),Xo; U{S)] - I [U{Sy, U{S)] 
= I [Xo; U{S)] + / [U{S')- U{S)\Xo] - I [C/(5); UiS')] 
= I[Xo;U{S)]-I[U{Sy,U{S')] 



(87) 



and 



Y,HUi;Xi\Xo]=I[U{Sy,XiS)\Xo] 



I [U{Sy,X{S)] + / [UiSy,Xo\X{S)] - I [Xo; U{S)] 
= I[U{SyxiS)]-I[Xo;U{S)] 



.8 



where the first equality is because of the conditional independence of {Xi, Ui) given Xq. 
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It can be seen that ([871) and (f88l) yields 

I [UiS);X,\U{S')] + J] / mx,\Xo] 

ies 

= I {U{S); X{S)) - I (C/(S); U{S')) , (89) 
which completes the proof for Lemma [31 

□ 

D Proof of Theorem [5] 

Proof. Define Zi = 2'^^' (l - p^), for i = 1,2. It can be shown that Zj > 1 by using = - 
2"2^i)(i - 2-2«2) and < p < 1 as shown in ©. 

Given a pair of quantization rates (i2i,i?2), the original optimization problem becomes 



minPi + P2, subject to: 



(1-/52)52 



(90) 



Define a matrix 



(91) 



91 PV9192 
Py/9l92 92 

whose eigenvalue decomposition is: A = QAQ-^, where the diagonal matrix A = diag{Ai,A2} has 
eigenvalues Aj of A and the column vectors of the matrix 



Q 



91,1 91,2 
92,1 92,2 



(92) 



are normalized eigenvectors associated with Ai and A2 respectively, which satisfy: 



Q^Q 



1 
1 



(93) 



Notice that the last constraint in ([90[1 is a quadratic form of variables yPi and VP2, which 
essentially determines an ellipse, we can perform an unitary transformation by introducing two new 
variables Yi and Y2: 

Y = [Yi,Y2V = Q^[/f\, ^/W (94) 
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such that the original optimization problem in (|90p is transformed to an equivalent one: 



min I |y I P = + if, subject to: 
\xYl + XiY^ > 63. 



(95) 



We show next that Y^ + if is minimized at the point where the second line 92, ill + 92,212 = ^2 
intersects with the ellipse Ailf + X2Y2 = as shown in Figured! In order to prove this, we need 
to first prove that the intersection of two lines gi,ili + (?t,2l2 = = 1,2 is inside the ellipse. Let 
[Y*, Y2] denote the crossing point of the two lines, which can be determined as [Yi*, I2I = Q^[6i, ^'2]^- 
It is sufficient to prove that Xi{Y^)'^ + A2(l"2*)^ < ^3, which is equivalent to having 



(96) 



This holds as 



w 



1-p 



n2 



[bi,b2]A[bi,b2f -b3 



V/(^1 - 1)(^2 - 1) 



P 



< 



(97) 



where Zi were defined right below (|50]) . and hence obtain the desired result. 

Assume Ai > A2. Solving quadratic function of IAI2 — A| = 0, we obtain eigenvalues Ai and A2: 



Ai,2 = ^(ffi +92) 



1 ± Vl - A 



where A = The entries of eigenvectors can be computed accordingly: 



92,1 



Ai - 92 



2Ai -91-92 

/ Ai - gi 
' 2Ai -91-92 



, Ql,2 



A2 - 92 



2X2 -91-92 



92,2 



A2 - 91 



2Ai -91-92 



{9i 



(99) 



Based on ([981) . we have 2Ai — 91—92 = — (2A2 — 91—92) = {91 + 5'2)Vl — A. Due to the 
non-negativeness of the ratios involved in ([99]) . it can be shown that Xi > gi > §2 > X2. In addition, 
because of Ai + A2 = gi + (72, 9i,j's satisfy: 



91,1 > |91,2|, 92,1 < 92,2- 



(100) 



We can therefore conclude that the lengths of the semi-axis l/\/Ai and l/\/A2 of the ellipse in the 
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direction of Yi and I2, respectively, satisfy l/\/Ai < l/\/A2- 

Also, since the slopes of the lines '?2,i^i +92,2^2 = &2 and gi,i^i + Q'i,2^2 = ^1 have the relationship 
of q2,i/q2,2 < 1 < '?i,i/|'i'i,2| , in addition, [17'5^2*] inside the ellipse, the minimum distance in (f95]l 
is attained at the point where the line with smaller slope intersects with the ellipse, as illustrated by 
Figure m which implies to minimize the total transmission power Pi + P2, the second constraint on 
P2 in (f90]l . as well as the third one, should be active. This is equivalent to having: R2 = /(C/2; Z, Ui) 
and Ri + R2 = I{Ui, U2; Z) + /(C/2; C/i), and i?i = I{Ui;Z). 

Consequently, when gi > 92, the decoding at the fusion center follows exactly as that described 
in Theorem [5l 

□ 

References 

[1] M .Gastpar and M. Vetterli, "On the capacity of large Gaussian relay networks," IEEE Trans- 
actions on Information Theory, vol. 51, no. 3, pp. 765-779, March 2005. 

[2] M .Gastpar and M. Vetterli, "Power, spatio-temporal bandwidth, and distortion in large sensor 
networks," IEEE Journal on Selected Areas in Communications, vol. 23, no. 4, pp. 745-754, 
April 2005. 

[3] M. Gastpar, "Uncoded transmission is exactly optimal for a simple Gaussian "sensor" network," 
in Information Theory and Applications Workshop, UCSD, Jan. 2007. 

[4] H. Behroozi and M. R. Soleymani, "Power-distortion performance of successive coding strategy 
in Gaussian CEO problem," in ICASSP, Nov. 2006, vol. 4, pp. IV-277-IV280. 

[5] Amos Lapidoth and Stephan Tinguely, "Sending a bi-variate Gaussian source over a Gaussian 
mac," in Proceedings International Symposium on Information Theory, 2006. 

[6] Y. Oohama, "Rate-distortion theory for Gaussian multiterminal source coding systems with 
several side information at the decoder," IEEE Transactions on Information Theory, vol. 51, 
no. 7, pp. 2577-2593, July 2005. 

[7] T. M. Cover and J. A. Thomas, Elements of Information Theory, Wiley, New York, 2006, 2nd 
Edition. 

[8] T. W. Anderson, An Introduction to Multivariate Statistical Analysis, Wiley Inter-Science, New 
York, 2003, 3rd Edition. 

[9] V. Prabhakaran, D. Tse, and K. Ramchandran, "Rate region of the quadratic Gaussian CEO 
problem," in Proceedings of ISIT, 2004. 



29 



[10] S. Boyd and L. Vandenberghe, Convex Optimization, Cambridge University Press, New York, 
2004. 

[11] Stephen P. Boyd, Seung Jean Kimy, Lieven Vandenberghez, and Arash Hassibix, "A tutorial on 
geometric programming," Optimization and Engineering, 2006, To appear. 

[12] M. Chiang, "Geometric programming for communication systems," Short monograph in Foun- 
dations and Trends in Communications and Information Theory, vol. 2, no. 1-2, pp. 1-154, July 
2005. 

[13] S. Wei, R. Kannan, S. Iyengar, and N. Rao, "Energy efficient data fusion in estimating Gaussian 
sources with sensor networks," Tech. Report, Dept. ECE LSU, 2007. 

[14] D. Tse and S. Hanly, "Multi-access fading channels: Part I: Polymatroid structure, optimal 
resource allocation and throughput capacities," IEEE Transactions on Information Theory, vol. 
44, no. 7, pp. 2796-2815, Nov. 1998. 

[15] M .Gastpar and M. Vetterli, "Source-channel communication in sensor networks," in Proc. IPSN 
2003, 2003. 

[16] G. Caire and K. Narayanan, "On the distortion snr exponent of hybrid digital- analog space-time 
coding," IEEE Transactions on Information Theory, vol. 53, no. 8, pp. 2867-2878, Aug. 2007. 



30 



^ Node 1 position 

Node 2 position, D=0.5 

(a) Result 1: D = 0.5, cj| = ci^ = 1, /3c = /3, = 2. 




(b) Results 2: D = 0.1, = a^^ = I, Pc = Ps = 2 




Node 1 position Node 2 position,D=0.01 



(c) Results 3: L> = 0.01, = = I, Pc = Ps = 2 



Figure 3: Total power consumption for separate source-channel coding (Red), joint source-channel 
coding (Blue) and uncoded (Black) schemes 
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Figure 4: Total power minimization 
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